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Description 

PIPELINE-BASED CIRCUIT WITH A 
POSTPONED CLOCK-GATING 
MECHANISM FOR REDUCING POWER 
CONSUMPTION AND RELATED DRIVING 

METHOD THEREOF 

Background of Invention 
[000 1 ] 1. Field of the Invention 

[0002] The present invention relates to a pipeline-based circuit. 
In particular, the present invention discloses a pipeline- 
based circuit utilizing a postponed clock-gating mecha- 
nism for reducing power consumption. 

[0003] 2. Description of the Prior Art 

[0004] A n accurate clock signal is a key factor for a logic circuit 
to perform a correct logic operation. That is, the clock 
signal is used to drive kernel circuit units such as counters 
and registers within the logic circuit, and a stable clock 



signal such as a clock signal generated from a crystal os- 
cillator always functions as a reference clock to arbitrate 
operations of the circuit units within the logic circuit. 
However, all of the circuit units within the logic circuit are 
not always active. When some of the circuit units enter an 
idle mode, these idle circuit units do not need to be driven 
by the clock signal continuously for performing related 
operations. If the clock signal is still inputted into the idle 
circuit units, power consumption of the logic circuit is in- 
creased unnecessarily. It is well known that the power 
consumption of the logic circuit is mainly generated from 
delivering the clock signal to these circuit units and en- 
abling these circuit units to run related logic operations. 
In order to reduce power consumption of the logic circuit 
such as a microprocessor, the clock signals are gated 
from triggering the idle circuit units. Therefore, unwanted 
power consumption is accordingly eliminated. In other 
words, the clock signal transferred to an idle circuit unit is 
first converted to be one signal with a fixed logic value 
("1" or "0"). Taking a clock signal that is a square wave for 
example, the logic value "1" corresponding to a high volt- 
age and the logic value "0" corresponding to a low voltage 
are alternatively switched. The clock signal is gated after 



the clock signal is converted to hold either the logic value 
"1" or the logic value "0". Because the logic circuit drives 
one internal circuit unit through a converted clock signal 
holding a fixed logic value, the operation associated with 
the circuit unit is blocked. Therefore, the total power con- 
sumption of the logic circuit is further reduced. The 
above-mentioned process is a well-known clock-gating 
mechanism. 

[0005] please refer to Fig.l and Fig. 2. Fig.l is a schematic dia- 
gram of a prior clock-gating circuit 10, and Fig. 2 is a tim- 
ing diagram of signals of the clock-gating circuit 10 
shown in Fig.l. The clock-gating circuit 10 has a con- 
troller 12 and a plurality of logic gates 14a, 14b, 14c. 
Each of the logic gates 14a, 14b, 14c performs the AND 
logic operation. The controller 12 includes a plurality of 
clock control units 13a, 13b, 13c respectively used for 
generating control signals 15a, 15b, 15c to corresponding 
logic gates 14a, 14b, 14c. In addition, a system clock 
generator 16 is capable of generating a clock signal 17 to 
the logic gates 14a, 14b, 14c. Then, the logic gates 14a, 
14b, 14c respectively output clock-gating output signals 
18a, 18b, 18c to corresponding logic units 20a, 20b, 20c. 
The above clock-gating output signals 18a, 18b, 18c are 



used to drive the logic units 20a, 20b, 20c. 
[0006] when the logic unit 20a enters the idle mode, the clock 
control unit 13a is activated to gate the clock signal 17 
through the control signal 15a. Please refer to Fig. 2. Dur- 
ing a period t0~t2, the control signal 15a holds the logic 
value "1" corresponding to a high voltage. Therefore, the 
clock signal 17 successfully passes through the logic gate 
14a. That is, the waveform of the clock-gating output sig- 
nal 18a is identical to the waveform of the clock signal 17, 
and the clock-gating output signal 18a drives the running 
logic unit 20a successfully. However, when the logic unit 
20a does not need to be activated during a period t2~t4, 
the clock control unit 13a outputs the control signal 15a 
with the logic value "0" corresponding to a low voltage. 
The clock signal 17 is gated through the logic gate 14a. 
That is, the clock-gating output signal 18a holds the con- 
stant logic value "0" during the period t2~t4, and the op- 
eration of the logic unit 20a is interrupted to reduce 
power consumption. During a period t4~t5 and a period 
t7~t8, the control signal 15a corresponds to the logic 
value "1" so that the clock signal 17 is inputted into the 
logic unit 20a again. During a period t5~t7, the logic unit 
20a does not need to be activated. Therefore, the control 



signal 15a then corresponds to the logic value "0" to gate 
the clock signal 17 from driving the logic unit 20a for re- 
ducing power consumption. 

[0007] Similarly, with regard to other logic units 20b, 20c, the 
clock control units 13b, 13c output the control signals 
15b, 15c corresponding to the logic value "0" to gate the 
clock signal 17 through the logic gates 14b, 14c when the 
logic units 20b, 20c do not need to be activated. For the 
logic unit 20b, the clock control unit 13b gates the clock 
signal 17 to reduce power consumption during periods 
t0~tl, t3~t4, t5~t7. For the logic unit 20c, the clock con- 
trol unit 13c gates the clock signal 17 to reduce power 
consumption during a period t5~t6. Please note that the 
operations associated with the logic units 20b, 20c are 
not repeated for simplicity. 

[0008] For the logic circuit, a pipeline structure, generally speak- 
ing, is adopted to improve processing efficiency. Please 
refer to Fig. 3, which is a block diagram of a prior art 
pipeline-based circuit 30. The pipeline-based logic circuit 
30 includes a plurality of processing units 32a, 32b, 32c, 
a pipeline control unit 34, and a clock-gating unit 36. 
Each of the processing units 32a, 32b, 32c includes a 
logic unit 38a, 38b, 38c and a buffer unit 40a, 40b, 40c. 



The logic units 38a, 38b, 38c are used to perform prede- 
termined logic operations respectively. For example, the 
logic unit 38a, 38b, or 38c can be an adder for doing bi- 
nary addition or a multiplier for doing binary multiplica- 
tion. 

[0009] The buffer units 40a, 40b, 40c corresponding to the logic 
units 38a, 38b, 38c are used to store calculation results 
outputted from the logic units 38a, 38b, 38c. Then, a cal- 
culation result currently stored in one logic unit is passed 
to a logic unit next to the current logic unit. The buffer 
units 40a, 40b, 40c can be prior art flip-flops. If the logic 
unit 38a is used to output a calculation result having a bit 
length equaling 64, the buffer unit 40 needs 64 flip-flops 
to store the calculation result. In addition, one clock sig- 
nal is necessary for controlling the buffer units 40a, 40b, 
40c to store the calculation results generated from the 
logic units 38a, 38b, 38c and controlling the buffer units 
40a, 40b, 40c to output the stored calculation results. 

[0010] The pipeline control unit 34 is used to control the opera- 
tion of the pipeline established by the processing units 
32a, 32b, 32c. As shown in Fig. 3, the pipeline control unit 
34 is capable of outputting control signals PA, PB, PC to 
control the processing units 32a, 32b, 32c. For example, 



an input data DATAJN is inputted into the processing unit 
32a. Therefore, the logic unit 38a starts processing the 
input data DATAJN according to a predetermined logic 
operation. After the predetermined logic operation is 
done, the pipeline control unit 34 generates the control 
signal PA according to current operating modes of the 
logic units 32a, 32b, 32b, and outputs the control signal 
PA to the logic unit 32a for activating the buffer unit 40a 
to store a calculation result generated from the logic unit 
32a. At the same time, the stored calculation result is 
passed to the next logic unit 32b. As mentioned above, 
one of the logic units 32a, 32b, 32c in the pipeline-based 
logic circuit 30 may not be used to process the input data 
DATAJN. For example, after the input data DATAJN has 
been processed by the logic units 32a, 32b, a branch may 
occur to terminate the process for the input data 
DATAJN. Therefore, the logic unit 32b does not need to 
pass its calculation result to the next logic unit 32c for 
following operations. For the input data DATAJN, any 
logic units following the logic unit 32b do not need to be 
activated. In other words, the related buffer units do not 
need to transfer calculation results stage by stage. There- 
fore, a prior art clock-gating mechanism can be adopted 



to reduce power consumption of the inactive buffer units. 
1 ] The clock-gating unit 36 is used to control the clock sig- 
nals inputted into the buffer units 40a, 40b, 40c posi- 
tioned in the corresponding processing units 32a, 32b, 
32c to achieve the goal of saving power. Generally speak- 
ing, the clock-gating unit 36 generates the clock signals 
CLK_GA, CLK_GB, CLK.GC inputted to the buffer units 
40a, 40b, 40c according to a system clock CLK_S and the 
control signals PA, PB, PC generated from the pipeline 
control unit 34. The control signals PA, PB, PC are deter- 
mined according to predetermined conditions. For in- 
stance, data transmission statuses associated with a bus 
and operating statuses of logic units function as the pre- 
determined condition used by the pipeline control unit 34 
to output the control signal PB. Please note that the pre- 
determined conditions for the processing units 32a, 32b, 
32c may differ from each other. For example, each of the 
control signals PA, PB, PC comprises a piping enable sig- 
nal for driving a corresponding logic unit to pipe its cal- 
culation result to the next logic unit, and a piping flush 
signal for nullifying the calculation result generated by the 
corresponding logic unit. Concerning the logic unit 32b, 
suppose that the control signal PB itself is a piping enable 



signal, and corresponds to three conditions A, B, C. That 
is, the three conditions A, B, C are used to determine 
whether the piping enable signal is outputted to make the 
processing unit 32b pipe its calculation result to the next 
processing unit 32c. The conditions A, B, C are related to 
the operating statuses of the logic units 32a, 32b, 32c, 
and the pipeline control unit 34 only uses the conditions 
A, B, C to set the control signal PB corresponding to the 
logic unit 32b. The pipeline control unit 34 is capable of 
determining if the piping enable signal is outputted to the 
logic unit 32b after the conditions A, B, C have been suc- 
cessfully determined. Therefore, the pipeline control unit 
34 needs to wait until all of the conditions A, B, C are de- 
termined. That is, the pipeline control unit 34 has to wait 
a longer period of time before generating the piping en- 
able signal for the processing unit 32b. When the control 
signal PB is used to drive the prior art clock-gating mech- 
anism, the above-mentioned delay time actually affects 
the operation of the clock-gating unit 36. The reason is 
described as follows. 
[0012] please refer to Fig. 4 and Fig. 5. Fig. 4 is an example 

schematic diagram of the clock-gating unit 36 shown in 
Fig. 3, and Fig. 5 is a timing diagram of signals running in 



the clock-gating unit 36 shown in Fig. 3. The clock-gating 
unit 36 has a logic gate 42 and an inverter 44. The logic 
gate 44 performs an NOR logic operation upon the clock 
control signal CLK_ENB and the clock signal CLK_S to gen- 
erate the clock signals CLK_GA, CLK_GB, CLK_GC for the 
processing units 32a, 32b, 32c. The clock control signal 
CLK_ENB is generated from a predetermined logic opera- 
tion upon the piping enable signal and the pipeline flush 
signal of each processing unit 32a, 32b, 32c. For exam- 
ple, suppose that the clock control signal CLK_ENB for the 
processing unit 32b is determined by the piping enable 
signal only. As mentioned above, when all of the condi- 
tions A, B, C correspond to the logic value "1", the clock 
control signal CLK_ENB is set by the logic value "1" at t4. 
As shown in Fig. 5, the clock control signal CLK.ENB does 
not have a transition from the logic value "0" to the logic 
value "1" until t4. With the processing performed by the 
clock-gating unit 36, the clock signal CLK_GB has a falling 
edge at t3, and corresponds to the logic value "0" during a 
period t3~t4. Then, the clock signal CLK_GB has a rising 
edge at t4, and the clock signal CLK_GB holds the logic 
value "1" after t4. If the processing unit 32b connected to 
the clock-gating unit 36 is triggered by rising edges of 



the clock signal CLK_GB, the clock-gating unit 36 should 
make the clock signal CLK_CB hold the logic value "1" af- 
ter tO to gate the clock signal CLK_S. However, because 
the clock control signal CLK_ENB is late-arrived as shown 
in Fig. 5, the processing unit 32b is triggered twice re- 
spectively at tO and t4. In other words, the clock signal 
CLK_GB leaves the logic value "0" at t4. Therefore, the 
clock signal CLK_GB having the rising edge at t4 is capa- 
ble of triggering the processing unit 32b, and as such, the 
clock signal CLK_GB cannot achieve the goal of reducing 
power consumption. 
[0013] Furthermore, a glitch is induced to affect the operation of 
the processing unit 32b. According to the prior art, the 
period t0~tl is defined to be a clock-gating hold time, 
and the period t2~t3 is defined to be a clock-gating setup 
time. In other words, the clock control signal CLK_ENB 
needs to be inputted before a falling edge of the clock 
signal CLK_S. Otherwise, the clock signal CLK_GB gener- 
ates the unwanted glitch during the period t3~t4. The un- 
wanted glitch induced for each of the clock signals 
CLK_GA, CLK_GB, CLK_GC likely results in the pipeline- 
based logic circuit 30 functioning incorrectly. 
Summary of Invention 



[0014] it j S therefore a primary objective of this invention to pro- 
vide a pipeline-based circuit utilizing control values to 
control the operation of the clock-gating mechanism so 
that whether or not a clock signal is gated is pre- 
determined by the control values before each stage of the 
pipeline-based circuit starts working. 

[0015] Briefly summarized, the preferred embodiment of the 

present invention discloses a pipeline-based circuit. The 
pipeline-based circuit has a plurality of logic operators 
cascaded to form at least a pipeline, a pipeline control 
unit for generating at least a control signal to each logic 
operator for controlling whether one logic operator needs 
to pipe data to a next logic operator, and a control value 
calculator for setting a control value of each logic opera- 
tor following a currently activated logic operator accord- 
ing to the control signals generated from the pipeline 
control unit. When each logic operator begins operating, 
the related control value is used to determine whether or 
not a clock signal piping data of the present logic opera- 
tor to the next logic operator is gated to reduce power 
consumption. 

[0016] it is an advantage of the present invention that the 

claimed pipeline-based circuit can prevent the possibly 



late-arrived control signal from affecting the operation of 
the clock-gating unit. That is, the claimed pipeline-based 
circuit is capable of gating clock signals in time for saving 
power successfully. In addition, the claimed pipeline- 
based circuit needs to implement additional second buffer 
units only. The circuit structure of the second buffer unit 
is simple, and the implementation is straightforward. 
Therefore, the control values can be easily applied to 
pipeline-based circuits having different pipeline struc- 
tures for accomplishing the same purpose of reducing 
power consumption. In addition, the postponed clock- 
gating mechanism avoids the degradation of pipeline 
clock speed. 

[0017] These and other objectives of the present invention will no 
doubt become obvious to those of ordinary skill in the art 
after reading the following detailed description of the pre- 
ferred embodiment, which is illustrated in the various fig- 
ures and drawings. 
Brief Description of Drawings 

[0018] pig.l is a schematic diagram of a prior clock-gating cir- 
cuit. 

[0019] pig. 2 is a timing diagram of signals of the clock-gating 
circuit shown in Fig.l. 



[0020] pig. 3 is a block diagram of a prior art pipeline-based cir- 
cuit. 

[0021] pig. 4 is a schematic diagram of a clock-gating unit shown 
in Fig. 3. 

[0022] Fig. 5 is a timing diagram of signals of the clock-gating 
unit shown in Fig. 3. 

[0023] Fig. 6 is a block diagram of a pipeline-based circuit ac- 
cording to the present invention. 
Detailed Description 

[0024] please refer to Fig. 6, which is a block diagram of a 

pipeline-based circuit 50 according to the present inven- 
tion. The pipeline-based circuit 50,which can be a micro- 
processor or a digital signal processor (DSP), has a plural- 
ity of processing units 52a, 52b, 52c, a pipeline control 
unit 54, and a control value generator 56. The processing 
units 52a, 52b, 52c are cascaded to establish a pipeline. 
Each of the processing units 52a, 52b, 52c has a logic unit 
58a, 58b, 58c, a clock-gating unit 60a, 60b, 60c, a first 
buffer unit 62a, 62b, 62c, and a second buffer unit 64a, 
64b, 64c. Taking the processing unit 52a for example, the 
logic unit 58a is used to perform a predetermined logic 
operation such as addition or multiplication. The first 
buffer unit 62a is used to store a calculation result out- 



putted from the logic unit 58a, and to pass the stored cal- 
culation result to the next processing unit 52b. The clock- 
gating unit 60 is used to control a clock signal inputted 
into the first buffer unit 62a according to an operating 
status (an "active"mode or an "idle"mode) of the first 
buffer unit 62a. If the first buffer unit 62a does not need 
to be driven by the clock signal, the clock signal is gated 
for reducing power consumption. In the preferred embod- 
iment, the second buffer units 64a, 64b, 64c store control 
values 66a, 66b, 66c respectively, and the control values 
66a, 66b, 66c are used for controlling the clock-gating 
units 60a, 60b, 60c to gate clock signals inputted into the 
first buffer units 62a, 62b, 62c. In addition, the pipeline 
control unit 54 outputs control signals COA, COB, GOC, 
NullA, NullB, NullC for controlling operations of the pro- 
cessing units 52a, 52b, 52c. For instance, when the 
pipeline control unit 54 delivers the control signal GOA to 
the processing unit 52a, it means that the calculation re- 
sult of the logic unit 58a needs to be piped to the next 
processing unit 52b. However, if the pipeline control unit 
54 outputs the control signal NullA to the processing unit 
52a, it means that the calculation result generated from 
the logic unit 58a needs to be nullified. Similarly, the con- 



trol signals GOB, GOC, NullB, NullC perform the same op- 
eration mentioned above, and repeated description is 
omitted for conciseness. The preferred embodiment, 
therefore, sets the control values 66a, 66b, 66c according 
to the control signals GOA, GOB, GOC, NullA, NullB, NullC 
corresponding to the processing units 52a, 52b, 52c. That 
is, the control value generator 56 generates the control 
signals Va", Vb", Vc" respectively to set corresponding 
control values 66a, 66b, 66c. Please note that the control 
signals Va", Vb", Vc" represent the updated control signals 
Va, Vb, Vc for setting the control values 66a, 66b, 66c. 
The detailed operation related to controlling the first 
buffer units 62a, 62b, 62c through the control values 66a, 
66b, 66c inputted into corresponding clock-gating units 
60a, 60b, 60c is described as follows. 
[0025] | n the preferred embodiment, the clock-gating units 60a, 
60b, 60c do not gate the clock signals generated by a 
clock generator from being inputted into corresponding 
first buffer units 62a, 62b, 62c if the control values 66a, 
66b, 66c hold the logic value "1". On the contrary, the 
clock-gating units 60a, 60b, 60c gate the clock signals 
from being inputted into corresponding first buffer units 
62a, 62b, 62c for reducing power consumption if the con- 



trol values 66a, 66b, 66c hold the logic value "0". Please 
note that the first buffer units 62a, 62b, 62c and the sec- 
ond buffer units 64a, 64b, 64c in the preferred embodi- 
ment are built using flip-flops, and the first buffer units 
62a, 62b, 62c and the second buffer units 64a, 64b, 64c 
can store data or output data according to the received 
clock signals. The circuit structures of the clock-gating 
units and the flip-flops are well known, and the lengthy 
description is skipped for simplicity. In addition, the first 
buffer units 62a, 62b, 62c and the second buffer units 
64a, 64b, 64c can be implemented by other circuits capa- 
ble of storing data and outputting data. 
[0026] Suppose that the logic value "1" has been assigned to the 
control values 66a, 66b, 66c in the processing units 52a, 
52b, 52c through the control signals Va", Vb", Vc". There- 
fore, the clock signals are passed to the first buffer units 
62a, 62b, 62c through the clock-gating units 60a, 60b, 
60c for driving the logic units 58a, 58b, 58c to generate 
calculation results respectively. Then, the pipeline control 
unit 54 starts setting logic values of the control signals 
GOA, GOB, GOC, NullA, NullB, NullC according to the de- 
fined rules for the processing units 52a, 52b, 52c. In the 
preferred embodiment, the control value generator 56 



generates the control signals Va", Vb", Vc" for updating 
the control values of the processing units 52a, 52b, 52c 
according to the control signals GOA, GOB, GOC, NullA, 
NullB, NullC, Va, Vb, Vc. The control signals Va, Vb, Vc in- 
putted into the control value generator 56 represent the 
control values 66a, 66b, 66c currently stored in the sec- 
ond buffer units 64a, 64b, 64c, and the control signal Va", 
Vb", Vc", outputted from the control value generator 56 
are used to update the control values 66a, 66b, 66c cur- 
rently stored in the second buffer units 64a, 64b, 64c. In 
other words, the control signals Va, Vb, Vc and the control 
signals Va", Vb", Vc" are identical. However, the control 
signals Va, Vb, Vc and the control signals Va", Vb", Vc" re- 
spectively represent the current control values and the 
next control values. 
[0027] For the control value 66b of the processing unit 52b, the 
rules for determining the control signal Vb" are listed be- 
low. 

[0028] Rule(l): if Va=l &GOA& !NullA, Vb=l 
[0029] Ru | e ( 2 ): if Vb=l & !GOA & !GOB & INullB, Vb"=l 
[0030] R U | e (3) : jf GOA & NullA, Vb=0 
[0031] Rule (4): if !GOA & GOB, Vb"=0 



[0032] Ru | e (5) : if Va =0 & GOA, Vb"=0 

[0033] The above rules are described as follows. For the rule (1), 
the control value 66a holds the logic value "1" so that the 
clock signal is successfully inputted into the first buffer 
unit 62a. Therefore, the first buffer unit 62a functions 
normally owing to the operative clock signal. In addition, 
the pipeline control unit 54 outputs the control signal 
GOA to the processing unit 52a for making the processing 
unit 52a pipe the calculation result to the next processing 
unit 52b. At this time, the pipeline control unit 54 does 
not output the control signal NullA to the processing unit 
52ayet for making the processing unit 52a nullify the cal- 
culation result. Therefore, the control value 66b is set to 
the logic value "1". In other words, the following first 
buffer unit 62b is enabled to start working according to 
the rule (1). 

[0034] For the rule (2), the control value 66b holds the logic 

value "1" so that the clock signal is successfully inputted 
into the first buffer unit 62b. Now, the first buffer unit 
62b functions normally owing to the operative clock sig- 
nal. In addition, the pipeline control unit 54 does not out- 
put the control signal GOA to the processing unit 52a yet 
for making the processing unit 52a pipe the calculation 



result to the next processing unit 52b, and the pipeline 
control unit 54 does not output the control signal GOB to 
the processing unit 52b yet for making the processing 
unit 52b pipe the calculation result to the next processing 
unit 52c. At the same time, the pipeline control unit 54 
does not output the control signal NullB to the processing 
unit 52b for making the processing unit 52b nullify the 
calculation result. Therefore, the control value 66b is not 
modified, and still keeps the original logic value "1". 
[0035] For the rule (3), the pipeline control unit 54 outputs the 

control signal NullA to the processing unit 52a for nullify- 
ing the calculation result of the processing unit 52a. That 
is, the output data of the logic unit 58a is cleared. At the 
same time, the pipeline control unit 54 generates the con- 
trol signal GOA for driving the processing unit 52a to pipe 
the calculation result to the next processing unit 52b. It is 
noteworthy that the calculation result of the processing 
unit 52a has been nullified. Therefore, the data inputted 
into the logic unit 58b is not valid, and the processing 
unit 52b does not need to activate the first buffer unit 62b 
for piping the calculation result of the logic unit 58b to 
the next processing unit 52c. The logic value "0" is as- 
signed to the control value 66b so that the clock-gating 



unit 60b gates the clock signal from being inputted into 
the first buffer unit 62b. 

[0036] For the rule (4), the pipeline control unit 54 outputs the 

control signal GOB to the processing unit 52b for control- 
ling the processing unit 52b to pipe the calculation result 
to the next processing unit 52c. However, the pipeline 
control unit 54 does not output the control signal GOA to 
the processing unit 52a yet. Therefore, the logic unit 58b 
of the processing unit 52b does not receive any input data 
used for calculating the calculation result, and no calcula- 
tion result needs to be passed to the next processing unit 
52c through the first buffer unit 62b. The logic value "0" 
is then assigned to the control value 66b so that the 
clock-gating unit 60b gates the clock signal from being 
inputted into the first buffer unit 62b. In other words, the 
first buffer unit 62 is unable to output any valid data to 
the processing unit 52c. 

[0037] For the rule (5), the clock signal originally inputted into 

the first buffer unit 62a is gated because the control value 
66a currently holds the logic value "0". Therefore, the first 
buffer unit 62a is unable to function normally for piping 
the calculation result of the processing unit 52a to the 
next processing unit 52b. Even though the pipeline con- 



trol unit 54 outputs the control signal GOA to the pro- 
cessing unit 52a, the processing unit 52a is still unable to 
pipe its calculation result to the next processing unit 52b. 
Concerning the processing unit 52b, the processing unit 
52b does not receive any valid input data for the logic unit 
58b. The first buffer unit 62b of the processing unit 52b, 
therefore, does not need to be driven by the clock signal 
for piping the calculation result of the logic unit 58b to 
the processing unit 52c. The logic value "0" is then as- 
signed to the control value 66b. 
[0038] The control values 66a, 66b, 66c of the processing units 
52a, 52b, 52c hold either the logic value "1" or the logic 
value "0" according to the above-listed rules. From the 
above description, the control value 66c of the processing 
unit 52c following the processing unit 52b must corre- 
spond to the logic value "0" through the rule (5) after the 
control value 66b of the processing unit 52b is set to the 
logic value "0" through the rule (3), the rule (4), or the rule 
(5). Therefore, when the processing unit 52c starts work- 
ing, the control value 66c drives the clock-gating unit 60c 
to gate the clock signal from being inputted into the first 
buffer unit 62c. 

[0039] | n the preferred embodiment, the second buffer units 64a, 



64b, 64c also need to be driven by clock signals for stor- 
ing control values 66a, 66b, 66c and outputting the con- 
trol values 66a, 66b, 66c to corresponding clock-gating 
units 60a, 60b, 60c. Though the second buffer units 64a, 
64b, 64c are added to the processing units 52a, 52b, 52c 
to keep the control values 66a, 66b, 66c used for control- 
ling clock signals inputted into the first buffer units 62a, 
62b, 62c. Compared with the power consumption of a 
system clock generator continuously outputting clock sig- 
nals to drive the idle first buffer units 62a, 62b, 62c, the 
power consumption of the system clock generator contin- 
uously outputting the clock signals to drive the second 
buffer units 64a, 64b, 64c is negligible. For example, the 
logic unit 58 generates the calculation result having a bit 
length equaling 64. The first buffer unit 62a, therefore, 
requires 64 flip-flops to handle the calculation result cor- 
rectly. However, only one flip-flop is needed to keep the 
control value 66a. To sum up, the preferred embodiment 
having these additional second buffer units 64a, 64b, 64c 
does not greatly raise the power consumption of the sys- 
tem clock generator. Actually, the preferred embodiment 
not only reduces the power consumption, but also allows 
operation of the clock-gating unit to conform to the limi- 



tation of the well-known clock-gating setup time. 
[0040] | n addition, the preferred embodiment uses the control 
values 66a, 66b, 66c to drive the clock-gating units 60a, 
60b, 60c. The control value 66c of the processing unit 52c 
following the processing unit 52b certainly corresponds to 
the logic value "0" through the rule (5) after the control 
value 66b of the processing unit 52b is set to the logic 
value "0" through the rule (3), the rule (4), or the rule (5). 
Therefore, when the processing unit 52c starts working, 
the control value 66c drives the clock-gating unit 60c to 
gate the clock signal from being inputted into the first 
buffer unit 62c. In other words, the power consumption of 
the system clock generator is quickly reduced when the 
processing unit 52c starts working. The time wasted for 
waiting a pipeline control unit of a running prior process- 
ing unit to finish delivering a wanted control signal to a 
clock-gating unit for gating a clock signal is cut down 
now with the help of the claimed control values. That is, 
with the implementation of the control values, the pre- 
ferred embodiment is capable of transmitting the control 
values to the clock-gating units when the corresponding 
processing units start working. Therefore, the preferred 
embodiment can prevent the control signal delayed by the 



pipeline control unit from affecting the operation of the 
clock-gating unit. That is, the preferred embodiment is 
capable of gating clock signals correctly for saving power, 
and is capable of effectively improving cycle time of the 
pipeline-based circuit 50. 
[0041] please note that the preferred embodiment only utilizes 
one pipeline to illustrate its operation. However, the 
pipeline-based circuit 50 can be built according to a su- 
per-scalar structure or a super-pipeline structure. In ad- 
dition, the pipeline-based circuit 50 still can utilize the 
control values to control corresponding clock-gating 
units. Taking a super-scalar structure with a plurality of 
pipelines for example, a plurality of processing units are 
located at each pipeline. Therefore, a plurality of buffer 
units used for storing control values are then imple- 
mented according to a total number of the processing 
units. With the above-mentioned rules for determining the 
control values, the control values are capable of achieving 
the goal of saving power through correctly gating clock 
signals. To sum up, the claimed control values can be 
easily implemented on the pipeline-based circuits 50 hav- 
ing different pipeline structures. Therefore, the pipeline- 
based circuit according to the present invention has great 



scalability. 

[0042] | n contrast to the prior art, the claimed pipeline-based 
circuit utilizes second buffer units to store control values 
used for controlling clock-gating units. When one pro- 
cessing unit does not need to pipe a calculation result to a 
next stage, the related control value is set by a predeter- 
mined logic value so that the clock signal is gated when 
the processing unit starts working. In other words, after 
the processing unit starts working, the operation of gating 
the clock signal for reducing power consumption associ- 
ated with the clock signal is completed within a prior art 
clock-gating setup time. Furthermore, when following 
processing units start working, the corresponding control 
values are set by the same predetermined value according 
to the above-mentioned claimed rules for gating the clock 
signals inputted into the first buffer units of the following 
processing units. Therefore, the claimed pipeline-based 
circuit can prevent the late-arrived control signal from af- 
fecting the operation of the clock-gating unit. That is, the 
claimed pipeline-based circuit is capable of gating clock 
signals in time for saving power successfully. In addition, 
the claimed pipeline-based circuit only requires additional 
second buffer units. The circuit structure of the second 



buffer unit is simple, and the implementation is easy. 
Therefore, the control values can be easily applied to 
pipeline-based circuits having different pipeline struc- 
tures for accomplishing the same purpose of reducing 
power consumption. 
[0043] Those skilled in the art will readily observe that numerous 
modifications and alterations of the device may be made 
while retaining the teachings of the invention. Accord- 
ingly, the above disclosure should be construed as limited 
only by the metes and bounds of the appended claims. 



